AITopics | multi-agent game

Collaborating Authors

multi-agent game

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Conservative Offline Policy Adaptation in Multi-Agent Games

Neural Information Processing SystemsDec-26-2025, 11:50:53 GMT

Prior research on policy adaptation in multi-agent games has often relied on online interaction with the target agent in training, which can be expensive and impractical in real-world scenarios. Inspired by recent progress in offline reinforcement learning, this paper studies offline policy adaptation, which aims to utilize the target agent's behavior data to exploit its weakness or enable effective cooperation. We investigate its distinct challenges of distributional shift and risk-free deviation, and propose a novel learning objective, conservative offline adaptation, that optimizes the worst-case performance against any dataset consistent proxy models. We propose an efficient algorithm called Constrained Self-Play (CSP) that incorporates dataset information into regularized policy learning. We prove that CSP learns a near-optimal risk-free offline adaptation policy upon convergence. Empirical results demonstrate that CSP outperforms non-conservative baselines in various environments, including Maze, predator-prey, MuJoCo, and Google Football.

conservative offline policy adaptation, multi-agent game, name change, (3 more...)

Neural Information Processing Systems

Genre: Research Report (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Finding Friend and Foe in Multi-Agent Games

Neural Information Processing SystemsDec-25-2025, 17:28:27 GMT

Recent breakthroughs in AI for multi-agent games like Go, Poker, and Dota, have seen great strides in recent years. Yet none of these games address the real-life challenge of cooperation in the presence of unknown and uncertain teammates. This challenge is a key game mechanism in hidden role games. Here we develop the DeepRole algorithm, a multi-agent reinforcement learning agent that we test on The Resistance: Avalon, the most popular hidden role game. DeepRole combines counterfactual regret minimization (CFR) with deep value networks trained through self-play.

friend and foe, multi-agent game, name change, (6 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.43)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)

Add feedback

Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols

Neural Information Processing SystemsNov-21-2025, 15:37:18 GMT

Learning to communicate through interaction, rather than relying on explicit supervision, is often considered a prerequisite for developing a general AI. We study a setting where two agents engage in playing a referential game and, from scratch, develop a communication protocol necessary to succeed in this game. Unlike previous work, we require that messages they exchange, both at train and test time, are in the form of a language (i.e.

learning, multi-agent game, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.84)

Add feedback

Reviews: Finding Friend and Foe in Multi-Agent Games

Neural Information Processing SystemsJan-25-2025, 17:22:23 GMT

The paper builds on well-known methods (CFR) and provides novel improvements and modifications that extend the approach to a multiplayer, hidden-role setting. This is original and novel and creative, though the crucial role of CFR cannot be understated. Related work appears to be adequately cited. The empirical results provide the main validation for the soundness and quality of the proposed algorithm; this is reasonable and is explained well in the paper. I have not spotted any obvious illogicalities or mistakes.

empirical result, friend and foe, multi-agent game

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Add feedback

Reviews: Finding Friend and Foe in Multi-Agent Games

Neural Information Processing SystemsJan-25-2025, 17:22:13 GMT

All reviewers agree that the paper provides some nice contributions (extending CFR beyond 2 players and tackling Avalon) and that the authors succeed well with their rebuttal to address some of the major concerns brought on by some of the referees. They have responded adequately and furthermore open-sourced their implementation. We expect the authors though to carry out the promised changes (and also improve on the notation).

friend and foe, multi-agent game

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Add feedback

Conservative Offline Policy Adaptation in Multi-Agent Games

Neural Information Processing SystemsJan-19-2025, 17:43:35 GMT

Prior research on policy adaptation in multi-agent games has often relied on online interaction with the target agent in training, which can be expensive and impractical in real-world scenarios. Inspired by recent progress in offline reinforcement learn- ing, this paper studies offline policy adaptation, which aims to utilize the target agent's behavior data to exploit its weakness or enable effective cooperation. We investigate its distinct challenges of distributional shift and risk-free deviation, and propose a novel learning objective, conservative offline adaptation, that optimizes the worst-case performance against any dataset consistent proxy models. We pro- pose an efficient algorithm called Constrained Self-Play (CSP) that incorporates dataset information into regularized policy learning. We prove that CSP learns a near-optimal risk-free offline adaptation policy upon convergence.

conservative offline policy adaptation, multi-agent game, target agent

Neural Information Processing Systems

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.66)

Add feedback

Finding Friend and Foe in Multi-Agent Games

Neural Information Processing SystemsOct-10-2024, 12:37:41 GMT

Recent breakthroughs in AI for multi-agent games like Go, Poker, and Dota, have seen great strides in recent years. Yet none of these games address the real-life challenge of cooperation in the presence of unknown and uncertain teammates. This challenge is a key game mechanism in hidden role games. Here we develop the DeepRole algorithm, a multi-agent reinforcement learning agent that we test on "The Resistance: Avalon", the most popular hidden role game. DeepRole combines counterfactual regret minimization (CFR) with deep value networks trained through self-play.

deeprole outperform, friend and foe, multi-agent game, (4 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reviews: Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols

Neural Information Processing SystemsOct-8-2024, 00:33:36 GMT

Increasing my score based on the authors rebuttal. The argument that the proposed method can complement human-bot training makes sense. Also, it seems RL baseline experiments were exhaustive. But the argument about the learnt language being compositional should be toned down since there is not enough evidence to support it. Old reviews: The paper proposes to use Gumbel-softmax for training sender and receiver agents in a referential game like Lazaridou (2016).

communication, continuous communication, natural language, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Add feedback

Towards Distraction-Robust Active Visual Tracking

Zhong, Fangwei, Sun, Peng, Luo, Wenhan, Yan, Tingyun, Wang, Yizhou

arXiv.org Artificial IntelligenceJun-18-2021

In active visual tracking, it is notoriously difficult when distracting objects appear, as distractors often mislead the tracker by occluding the target or bringing a confusing appearance. To address this issue, we propose a mixed cooperative-competitive multi-agent game, where a target and multiple distractors form a collaborative team to play against a tracker and make it fail to follow. Through learning in our game, diverse distracting behaviors of the distractors naturally emerge, thereby exposing the tracker's weakness, which helps enhance the distraction-robustness of the tracker. For effective learning, we then present a bunch of practical methods, including a reward function for distractors, a cross-modal teacher-student learning strategy, and a recurrent attention mechanism for the tracker. The experimental results show that our tracker performs desired distraction-robust active visual tracking and can be well generalized to unseen environments. We also show that the multi-agent game can be used to adversarially test the robustness of trackers.

distractor, target and distractor, tracker, (15 more...)

arXiv.org Artificial Intelligence

2106.1011

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

DeepMind Wants to Reimagine One of the Most Important Algorithms in Machine Learning

#artificialintelligenceMay-20-2021, 12:25:20 GMT

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Principal component analysis(PCA) is one of the key algorithms that are part of any machine learning curriculum. Initially created in the early 1900s, PCA is a fundamental algorithm to understand data in high-dimensional spaces which are common in deep learning problems.

deepmind, optimization problem, pca, (7 more...)

#artificialintelligence

Industry: Education > Focused Education > Special Education (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback